Approximating difference evaluations with local knowledge
نویسندگان
چکیده
Difference evaluation functions have resulted in excellent multiagent behavior in many domains, including air traffic control and distributed sensor network control. In addition to empirical evidence, there is theoretical evidence that suggests difference evaluation functions help shape private agent utilities/objectives in order to promote coordination on a system-wide level. However, calculating difference evaluation functions requires global knowledge about the system state and joint action as well as the mathematical form of the system objective function, which are often unavailable. In this work, we demonstrate that a local estimate of the system evaluation function may be used to locally compute difference evaluations, allowing for difference evaluations to be computed in multiagent systems where only local state and action information as well as a broadcast value of the system evaluation function are available. We demonstrate that approximating difference evaluation functions results in better performance and faster learning than when using global evaluation functions, and performs only slightly worse than when directly computing difference evaluations.
منابع مشابه
Approximating Difference Evaluations with Local Information
Difference evaluations can effectively shape agent feedback in multiagent learning systems, and have provided excellent results in a variety of domains, including air traffic control and distributed sensor network control. In addition to empirical evidence, there is theoretical evidence demonstrating how difference evaluations help shape agent utilities/objectives in order to promote system-wid...
متن کاملLocal Approximation of Difference Evaluation Functions
Difference evaluation functions have resulted in excellent multiagent behavior in many domains, including air traffic and mobile robot control. However, calculating difference evaluation functions requires determining the value of a counterfactual system objective function, which is often difficult when the system objective function is unknown or global state and action information is unavailab...
متن کاملConvergence, Consistency and Stability in Fuzzy Differential Equations
In this paper, we consider First-order fuzzy differential equations with initial value conditions. The convergence, consistency and stability of difference method for approximating the solution of fuzzy differential equations involving generalized H-differentiability, are studied. Then the local truncation error is defined and sufficient conditions for convergence, consistency and stability of ...
متن کاملLocally Weighted Least Squares Temporal Difference Learning
This paper introduces locally weighted temporal difference learning for evaluation of a class of policies whose value function is nonlinear in the state. Least squares temporal difference learning is used for training local models according to a distance metric in state-space. Empirical evaluations are reported demonstrating learning performance on a number of strongly non-linear value function...
متن کاملConvergence of a first order scheme for a non local eikonal equation ∗
We prove the convergence of a first order finite difference scheme approximating a non local eikonal Hamilton-Jacobi equation. The non local character of the problem makes the scheme not monotone in general. However, by using in a convenient manner the convergence result for monotone scheme of Crandall Lions, we obtain the same bound |∆X| + ∆t for the rate of convergence.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014